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Abstract. These lectures cover various aspects of the statistical descrip- 
tion of cosmological density fields. Observationally, this consists of the point 
process defined by galaxies, and the challenge is to relate this to the con- 
tinuous density field generated by gravitational instability in dark matter. 
The main topics discussed are (1) nonlinear structure in CDM models; (2) 
statistical measures of clustering; (3) redshift-space distortions; (4) small- 
scale clustering and bias. The overall message is optimistic, in that simple 
assumptions for where galaxies should form in the mass density field allow 
one to understand the systematic differences between galaxy data and the 
predictions of CDM models. 



1. Preamble 

The subject of large-scale structure is in a period of very rapid development. 
For many years, this term would have meant only one thing: the distribu- 
tion of galaxies. However, we are increasingly able to probe the primordial 
fiuctuations through the CMB, so that the problem of galaxy formation 
and clustering is now only one aspect of the general picture of structure 
formation. The rationale for studying the large-scale distribution of galax- 
ies is therefore altering. Ten years ago, we were happy to produce samples 
based on a rather sparse random sampling of the galaxy distribution, with 
the main aim of tying down statistics such as the large-scale power spec- 
trum of number-density fiuctuations. A major goal of the subject remains 
the measurement of the fluctuation spectrum for wavelengths > 100 Mpc, 
and the demonstration that this agrees in shape with what can be inferred 
from the CMB. Nevertheless, we are now increasingly interested in study- 
ing the pattern of galaxies with the highest possible fidelity - demanding 
deep, fully-sampled surveys of the local universe. Such studies will tell us 



2 



J.A. PEACOCK 



much about the processes by which galaxies formed and evolved within the 
distribution of dark matter. The aim of these lectures is therefore to look 
both backwards and forwards: reviewing the foundations of the subject and 
looking forward to the future issues. 



2. The CDM family album 

2.1. THE LINEAR SPECTRUM 

The basic picture of inflationary models (but also of cosmology before in- 
flation) is of a primordial power-law spectrum, written dimensionlessly as 
the logarithmic contribution to the fractional density variance, o^: 

where n stands for ns hereafter. This undergoes linear growth 

D{a) 



5k{a) = 5k{ao) [-^^\ ^k, (2) 

where the linear growth law is 

D{a) = ag[n{a)] (3) 

in the matter era, and the growth suppression for low is 

g{n) ~ (open) 

~ (flat) ^ ' 

The transfer function depends on the dark-matter content as shown in 
figure 1. 

Note the baryonic oscillations in figure 1; these can be significant even in 
CDM-dominated models when working with high-precision data. Eisenstein 
Sz Hu (1998) are to be congratulated for their impressive persistence in 
finding an accurate fitting formula that describes these wiggles. This is 
invaluable for carrying out a search of a large parameter space. 

The state of the linear-theory spectrum after these modifications is il- 
lustrated in figure 2. The primordial power-law spectrum is reduced at large 
k, by an amount that depends on both the quantity of dark matter and 
its nature. Generally the bend in the spectrum occurs near 1/k of order 
the horizon size at matter-radiation equality, oc {Q,h'^)~^. For a pure CDM 
universe, with scale-invariant initial fluctuations (n = 1), the observed spec- 
trum depends only on two parameters. One is the shape T = flh, and the 
other is a normalization. On the shape front, a government health warning 
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Figure 1. Transfer functions for various dark- matter models. The scaling with Q,h^ is 
exact only for the zero-baryon models; the baryon results are scaled from the particular 
case Qb = 1, /i = 1/2. 



is needed, as follows. It has been quite common to take F-based fits to ob- 
servations as indicating a measurement of but there are three reasons 
why this may give incorrect answers: 

(1) The dark matter may not be CDM. An admixture of HDM will 
damp the spectrum more, mimicking a lower CDM density. 

(2) Even in a CDM-dominated universe, baryons can have a significant 
effect, making F lower than D,h. An approximate formula for this is given 
in figure 2 (Peacock Sz Dodds 1994; Sugiyama 1995). 

(3) The strongest (and most-ignored) effect is tilt: if n 7^ 1, then even 
in a pure CDM universe a F-model fit to the spectrum will give a badly 
incorrect estimate of the density (the change in Clh is roughly 0.3(n — 1); 
Peacock & Dodds 1994). 

2.2. NORMALIZATION 

The other parameter is the normalization. This can be set at a number of 
points. The COBE normalization comes from large angle CMB anisotropics. 
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Fourier decomposition 
of density field 

Dimensionless power 



AHk) 



dink 



o:k'\5k\^ock'+^Tl 



n>l 



n=l 




T = Qh exp[-nB(l + V2h/Q)] 



Apparent density from matter-radiation horizon 



wavenumber k 



cr| = A2(A;eff), A;eff//iMpc-i = 0.172 + 0.011 [ln(r/0.34)]2 



Figure 2. This figure illustrates how the primordial power spectrum is modified as a 
function of density in a CDM model. For a given tilt, it is always possible to choose a 
density that satisfies both the COBE and cluster normalizations. 



and is sensitive to the power spectrum at A: ~ 10~^ /iMpc~^. The alternative 
is to set the normahzation near the quasihnear scale, using the abundance 
of rich clusters. Many authors have tried this calculation, and there is good 
agreement on the answer: 

as ^{0.5- 0.6) n;^''-^. (5) 

(White, Efstathiou & Frenk 1993; Eke et al. 1996; Viana & Liddle 1996). In 
many ways, this is the most sensible normalization to use for LSS studies, 
since it does not rely on an extrapolation from larger scales. 
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Figure 3. For 10% baryons, the value of n needed to reconcile COBE and the cluster 
normalization in CDM models. 

Within the CDM model, it is always possible to satisfy both these nor- 
malization constraints, by appropriate choice of F and n. This is illustrated 
in figure 3. Note that vacuum energy affects the answer; for reasonable 
values of h and reasonable baryon content, flat models require 2± 0.3, 
whereas open models require ilm — 0.5. 

2.3. THE NONLINEAR SPECTRUM 

On smaller scales {k > 0.1), nonlinear effects become important. These are 
relatively well understood so far as they affect the power spectrum of the 
mass (e.g. Hamilton et al. 1991; Jain, Mo & White 1995; Peacock Sz Dodds 
1996). Based on a fitting formula for the similarity solution governing the 
evolution of scale-free initial conditions, it is possible to predict the evolved 
spectrum in CDM universes to a few per cent precision (e.g. Jenkins et al. 
1998). 

These methods can cope with most smoothly-varying power spectra, but 
they break down for models with a large baryon content. Figure 1 shows 
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Figure 4- Baryonic fluctuations in the spectrum can become signiflcant for high- precision 
measurements. Although such features are much less important in the density spectrum 
than in the CMB (first panel), the order 10% modulation of the power is potentially 
detectable. However, nonlinear evolution has the effect of damping all beyond the second 
peak. This second feature is relatively narrow, and can serve as a clear proof of the past 
existence of oscillations in the baryon-photon fluid (Meiksin, White & Peacock 1999). 



that rather large oscillatory features would be expected if the universe was 
baryon dominated. The lack of observational evidence for such features is 
one reason for believing that the universe might be dominated by colli- 
sionless nonbaryonic matter (consistent with primordial nucleosynthesis if 

Nevertheless, baryonic fluctuations in the spectrum can become signifi- 
cant for high-precision measurements. Figure 4 shows that order 10% mod- 
ulation of the power may be expected in realistic baryonic models (Eisen- 
stein &: Hu 1998; Goldberg & Strauss 1998). Most of these features are 
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however removed by nonlinear evolution. The highest-fc feature to survive 
is usually the second peak, which almost always lies near k = 0.05 Mpc~^ 
(no h, for a change). This feature is relatively narrow, and can serve as a 
clear proof of the past existence of baryonic oscillations in forming the mass 
distribution (Meiksin, White & Peacock 1999). However, figure 4 empha- 
sizes that the easiest way of detecting the presence of baryons is likely to 
be through the CMB spectrum. The oscillations have a much larger 'visi- 
bility' there, because the small-scale CMB anisotropics come directly from 
the coupled radiation-baryon fluid, rather than the small-scale dark matter 
perturbations. 

3. Statistics 

Statistical measures of the cosmological density field relate to properties of 
the dimensionless density perturbation field 

= . (6) 

although 6 need not be assumed to be small. 
3.1. CORRELATION FUNCTIONS 

The simplest measure is the autocorrelation function of the density pertur- 
bation 

eA(r)^(5(x)<5(x + r)), (7) 

This is a straightforward statistical measure that can also be computed for 
the dark-matter distribution in iV-body simulations. Formally, the averag- 
ing operator here is an ensemble average, but one generally appeals to the 
ergodic nature of the density field to replace this with a volume average. 

However, galaxies are a point process, so what astronomers can measure 
in practice is the two-point correlation function, which gives the excess 
probability for finding a neighbour a distance r from a given galaxy. By 
regarding this as the probability of finding a pair with one object in each 
of the volume elements dVi and dV2, 

dP = pl[l + Ur)]dVidV2. (8) 

Is it true that Ca(^) = C2(^)? Life would certainly be simple if so, and much 
work on large-scale structure has implicitly assumed the Poisson clustering 
hypothesis, in which galaxies are assumed to be sampled at random from 
some continuous underlying density field. Many of the puzzles in the field 
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can however be traced to the fact that this hypothesis is probably false, as 
discussed below. 

A related quantity is the cross-correlation function. Here, one considers 
two different classes of object (a and b, say), and the cross-correlation 
function (^ab is defined as the (symmetric) probability of finding a pair in 
which dVi is occupied by an object from the first catalogue and dV2 by 
one from the second. Both cross- and auto-correlation functions are readily 
extended to higher orders and considerations of n-tuples of points in a given 
geometry. 

3.2. FOURIER SPACE 

For the Fourier counterpart of this analysis, wc assume that the field is 
periodic within some box of side L, and expand as a Fourier series: 

5(x)=5^4e-^i'-. (9) 

For a real field, (5k(— k) = <Jk(k). Using this definition in the correlation 
function, most cross terms integrate to zero through the periodic boundary 
conditions, giving 




In short, the correlation function is the Fourier transform of the power 
spectrum. 

We shall usually express the power spectrum in dimensionless form, as 
the variance per In A; (A^(A;) = d{S'^) / din k oc k^P[k]): 

= (2^ = r ^^'^ ^ ^''^ 

This gives a more easily visualizable meaning to the power spectrum than 
does the quantity VP{k), which has dimensions of volume: A^(A;) = 1 
means that there are order-unity density fluctuations from modes in the 
logarithmic bin around wavenumber k. A^(A;) is therefore the natural choice 
for a Fourier-space counterpart to the dimensionless quantity ^(r). 

In the days before inflation, the primordial power spectrum was chosen 
by hand, and the minimal assumption was a featureless power law: 

{\6kf) = P{k) <x k"" (12) 

The index n governs the balance between large- and small-scale power. 
Similarly, a power-law spectrum implies a power-law correlation function. 
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If ^(r) = (r/ro) , with 7 = n + 3, the corresponding 3D power spectrum 
is 

A2(fe) = - {kror r(2 - 7) sin i^^-llZ^ = /3(A:ro)^ (13) 

(= 0.903(/cro)^'^ if 7 = 1.8). This expression is only vahd for n < (7 < 3); 
for larger values of n, ^ must become negative at large r (because P{0) 
must vanish, implying ^(r) dr = 0). A cutoff in the spectrum at large 
k is needed to obtain physically sensible results. 

The most interesting value of n is the scale- invariant spectrum, n = 1, 
i.e. oc fc^. To see how the name arises, consider a perturbation 6^ in 
the gravitational potential: 

V2,5$ = 4ttGpo6 6<^k = -AirGpodk/k^ . (14) 

The two powers of k pulled down by mean that, if oc k^ for the power 
spectrum of density fluctuations, then A| is a constant. Since potential 
perturbations govern the flatness of spacetime, this says that the scale- 
invariant spectrum corresponds to a metric that is a fractal: spacetime 
has the same degree of 'wrinklincss' on each resolution scale. The total 
curvature fluctuations diverge, but only logarithmically at either extreme 
of wavelength. 

3.3. ERROR ESTIMATES 

A key question for these statistical measures is how accurate they are - 
i.e. how much does the result for a given finite sample depart from the 
ideal statistic averaged over an infinite universe? Terminology here can 
be confusing, in that a distinction is sometimes made between sampling 
variance and cosmic variance. The former is to be understood as arising 
from probing a given volume only with a finite number of galaxies (e.g. just 
the bright ones), so that \/iV statistics limit our knowledge of the mass 
distribution within that region. The second term concerns whether we have 
reached a fair sample of the universe, and depends on whether there is 
significant power in density perturbation modes with wavelengths larger 
than the sample depth. Clearly, these two aspects arc closely related. 

The quantitative analysis of these errors is most simply performed in 
Fourier space, and was given by Feldman, Kaiser & Peacock (1994). The 
results can be understood most simply by comparison with an idealized 
complete and uniform survey of a volume L^, with periodicity scale L. For 
an infinite survey, the arbitrariness of the spatial origin means that different 
modes are uncorrelated: 



{5k{K)mj)) = pmj- 



(15) 
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Each mode has an exponential distribution in power (because the complex 
coefficients Sk are 2D Gaussian-distributed variables on the Argand plane), 
for which the mean and rms are identical. The fractional uncertainty in the 
mean power measured over some fe-space volume is then just determined 
by the number of uncorrected modes averaged over: 

modes 

The only subtlety is that, because the density field is real, modes at k 
and —k are perfectly correlated. Thus, if the fc-space volume is a shell, the 
effective number of uncorrelated modes is only half the above expression. 

Analogous results apply for an arbitrary survey selection function. In the 
continuum limit, the Kroneker delta in the expression for mode correlation 
would be replaced a term proportional to a delta-function, (5[kj — kj]). Now, 
multiplying the infinite ideal survey by a survey window, /o(r), is equivalent 
to convolution in the Fourier domain, with the result that the power per 
mode is correlated over A;-space separations of order 1/-D, where D is the 
survey depth. 

Given this expression for the fractional power, it is clear that the pre- 
cision of the estimate can be manipulated by appropriate weighting of the 
data: giving increased weight to the most distant galaxies increases the ef- 
fective survey volume, boosting the number of modes. This sounds too good 
to be true, and of course it is: the above expression for the fractional power 
error applies to the sum of true clustering power and shot noise. The latter 
arises because we transform a point process. Given a set of N galaxies, we 
would estimate Fourier coefficients via 5^ = (1/iV) exp(— zk • Xi). From 
this, the expectation power is 

{\6k?) = P{k) + l/N. (17) 

The existence of an additive discreteness correction is no problem, but 
the fluctuations on the shot noise hide the signal of interest. Introducing 
weights boosts the shot noise, so there is an optimum choice of weight 
that minimizes the uncertainty in the power after shot-noise subtraction. 
Feldman, Kaiser k. Peacock (1994) showed that this weight is 

w = {l + nP)-^, (18) 

where n is the expected galaxy number density as a function of position in 
the survey. 

Since the correlation of modes arises from the survey selection function, 
it is clear that weighting the data changes the degree of correlation in k 
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space. Increasing the weight in low-density regions increases the effective 
survey volume, and so shrinks the fc-space coherence scale. However, the 
coherence scale continues to shrink as distant regions of the survey are 
given greater weight, whereas the noise goes through a minimum. There is 
thus a trade-off between the competing desirable criteria of high /c-space 
resolution and low noise. Tegmark (1996) shows how weights may be chosen 
to implement any given prejudice concerning the relative importance of 
these two criteria. See also Hamilton ( 1997b, c) for similar arguments. 

3.4. KARHUNEN-LOEVE AND ALL THAT 

Given these difficulties with correlated results, it is attractive to seek a 
method where the data can be decomposed into a set of statistics that are 
completely uncorrelated with each other. Such a method is provided by 
the Karhunen-Loeve formalism. Vogeley &: Szalay (1996) argued as follows. 
Define a column vector of data d; this can be quite abstract in nature, and 
could be e.g. the numbers of galaxies in a set of cells, or a set of Fourier 
components of the transformed galaxy number counts. Similarly, for CMB 
studies, d could be ST/T in a set of pixels, or spherical-harmonic coefficients 
a£„j. We assume that the mean can be identified and subtracted off, so that 
id) = in ensemble average. The statistical properties of the data are then 
described by the covariance matrix 

Cij = {did*) (19) 

(normally the data will be real, but it is convenient to keep things general 
and include the complex conjugate). 

Suppose we seek to expand the datavector in terms of a set of new 
orthonormal vectors: 

d = J2a^i;.; t*-tj = ^r3- (20) 

i 

The expansion coefficients are extracted in the usual way: (ij = d - ij)*y Now 
require that these coefficients be statistically uncorrelated, (a^ap = \i5ij 
(no sum on i). This gives 

ri-{dd*)-i;j = \Aj, (21) 

where the dyadic {dd*) is C, the correlation matrix of the data vector: 
{dd*)ij = djd*i. Now, the effect of operating this matrix on one of the il^i 
must be expandable in terms of the complete set, which shows that the V'i 
must be the eigenvectors of the correlation matrix: 



{dd*)-fj = Xfipj. 



(22) 
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Vogeley & Szalay further show that these uncorrelated modes are opti- 
mal for representing the data: if the modes are arranged in order of decreas- 
ing A, and the scries expansion truncated after n terms, the rms truncation 
error is minimized for this choice of eigenmodes. To prove this, consider the 
truncation error 

n CO 

e = d - ^ aifi = aifi. (23) 

i=l i=n+\ 

The square of this is 

oo 

{e') = E (24) 

i=n+l 

where (|aip) = ipi ■ C ■ tpi, as before. We want to minimize (e^) by varying 
the V'i , but we need to do this in a way that preserves normahzation. This 
is achieved by introducing a Lagrange multipher, and minimizing 

Yti-Q-ti + ^i^-tt-ti)- (25) 

This is easily solved if we consider the more general problem where tp* and 
ipi are independent vectors: 

g-A = >^A- (26) 

In short, the eigenvectors of Q are optimal in a least-squares sense for ex- 
panding the data. The process of truncating the expansion is a form of lossy 
data compression, since the size of the data vector can be greatly reduced 
without significantly affecting the fidelity of the resulting representation of 
the universe. 

The process of diagonalizing the covariance matrix of a set of data also 
goes by the more familiar name of principal components analysis, so what is 
the difference between the KL approach and PCA? In the above discussion, 
they arc identical, but the idea of choosing an optimal eigenbasis is more 
general than PCA. Consider the case where the covariance matrix can be 
decomposed into a 'signal' and a 'noise' term: 

Q = § + N, (27) 

where § depends on cosmological parameters that we might wish to esti- 
mate, whereas N is some fixed property of the experiment under consider- 
ation. In the simplest imaginable case, N might be a diagonal matrix, so 
PCA diagonalizes both S and N. In this case, ranking the PCA modes by 
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eigenvalue would correspond to ordering the modes according to signal-to- 
noise ratio. Data compression by truncating the mode expansion then does 
the sensible thing: it rejects all modes of low signal-to-noise ratio. 

However, in general these matrices will not commute, and there will 
not be a single set of eigenfunctions that are common to the ^ and ^ 
matrices. Normally, this would be taken to mean that it is impossible to 
find a set of coordinates in which both are diagonal. This conclusion can 
however be evaded, as follows. When considering the effect of coordinate 
transformations on vectors and matrices, we are normally forced to consider 
only rotation-like transformations that preserve the norm of a vector (e.g. 
in quantum mechanics, so that states stay normalized). Thus, we write d' = 
Q-d, where B is unitary, so that B B^ = |. If ^ is chosen so that its columns 
are the eigenvalues of N, then the transformed noise matrix, R ■ N ■ R\ is 
diagonal. Nevertheless, if the transformed S is not diagonal, the two will 
not commute. This apparently insuperable problem can be solved by using 
the fact that the data vectors are entirely abstract at this stage. There is 
therefore no reason not to consider the further transformation of scaling the 
data, so that N becomes proportional to the identity matrix. This means 
that the transformation is no longer unitary - but there is no physical 
reason to object to a change in the normalization of the data vectors. 

Suppose we therefore make a further transformation 

d" = W- d' (28) 

The matrix W is related to the rotated noise matrix: 

^' = diag(ni,n2,...) ^ ^ = diag (1/^^, 1/^^, . . .). (29) 

This transformation is termed prewhitening by Vogeley & Szalay (1996), 
since it converts the noise matrix to white noise, in which each pixel has a 
unit noise that is uncorrelated with other pixels. The effect of this trans- 



formation on the full covariance matrix is 

C'lj = {d'ld'^*) Q" = {W-B)-g-{W-B)'' (30) 

After this transformation, the noise and signal matrices certainly do com- 
mute, and the optimal modes for expanding the new data are once again 
the PCA eigenmodes in the new coordinates: 

g'-,g = Xg. (31) 

These eigenmodes must be expressible in terms of some modes in the orig- 
inal coordinates, ef. 

= iW-B)-ei. (32) 
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In these terms, the eigenproblem is 

{W-E)-Q-{W-B)^ ■{W-B)-ei = X{W ■ E) ■ (33) 

This can be simphfied using . ^ = ^'-i and ^'"^ = E ■ ^"^^^ to 
give 

Q ■ ■ ei = Xci, (34) 

so the required modes are eigenmodes of C • N~^. However, care is required 
when considering the orthonormality of the gj: ipj ■ ipj = §1 ■ N^^ ■ ej, so 
the gj are not orthonormal. If we write d = J2i o-iSi, then 

ai = {N~^ ■ Bi)^ ■d = 'iPl-d. (35) 

Thus, the modes used to extract the compressed data by dot product satisfy 
C ■ tp = \N ■ ijj, or finahy 

^.■lj; = X^.'lj;, (36) 

given a redefinition of A. The optimal modes are thus eigenmodes of 
hence the name signal-to-noise eigenmodes (Bond 1995; Bunn 1996). 

It is interesting to appreciate that the set of KL modes just discussed 
is also the 'best' set of modes to choose from a completely different point 
of view: they are the modes that are optimal for estimation of a parameter 
via maximum likelihood. Suppose we write the compressed data vector, x, 
in terms of a non-square matrix A (whose rows are the basis vectors ip^): 

x = 4-d. (37) 

The transformed covariance matrix is 

B = {xx^)=4-G-A^- (38) 

For the case where the original data obeyed Gaussian statistics, this is true 
for the compressed data also, so the likelihood is 

-2 In £ = In det ^ -I- X* • • X -I- constant (39) 

The normal variance on some parameter p (on which the covariance matrix 
depends) is 
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Without data, we don't know this, so it is common to use the expectation 
value of the rhs as an estimate (recently, there has been a tendency to dub 
this the 'Fisher matrix'). 

We desire to optimize dp by an appropriate choice of data-compression 
vectors, V'i- By writing ap in terms of J., Q and d, it may eventually be 
shown that the desired optimal modes satisfy 



For the case where the parameter of interest is the cosmological power, 
the matrix on the Ihs is just proportional to S, so we have to solve the 
eigenproblem 



The optimal modes for parameter estimation in the linear case are thus 
identical to the PCA modes of the prewhitened data discussed above. The 
more general expression was given by Tegmark, Taylor &: Heavens (1997), 
and it is only in this case, where the covariance matrix is not necessarily 
linear in the parameter of interest, that the KL method actually differs 
from PCA. 

The reason for going to all this trouble is that the likelihood can now 
be evaluated much more rapidly, using the compressed data. This allows 
extensive model searches over large parameter spaces that would be unfea- 
sible with the original data (since inversion of an x covariance matrix 
takes a time proportional to N"^). Note however that the price paid for this 
efficiency is that a different set of modes need to be chosen depending on 
the model of interest, and that these modes will not in general be optimal 
for expanding the dataset itself. Nevertheless, it may be expected that ap- 
plication of these methods will inevitably grow as datasets increase in size. 
Present applications mainly prove that the techniques work: see Matsub- 
ara, Szalay & Landy (1999) for application to the LCRS, or Padmanabhan, 
Tegmark &; Hamilton (1999) for the UZC survey. The next generation of 
experiments will probably be forced to resort to data compression of this 
sort, rather than using it as an elegant alternative method of analysis. 

4. Redshift-space effects 




(41) 



(42) 



With a redefinition of A, this becomes 



S-tp = XN-tp. 



(43) 



Peculiar velocity fields are responsible for the distortion of the clustering 
pattern in redshift space, as first clearly articulated by Kaiser (1987). For 
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a survey that subtends a small angle (i.e. in the distant-observer approx- 
imation), a good approximation to the anisotropic redshift-space Fourier 
spectrum is given by the Kaiser function together with a damping term 
from nonlinear effects: 

5l = 5l{l + (5i?)D{kcjti), (44) 

where (3 = /h, h being the linear bias parameter of the galaxies under 
study, and /i = k • f . For an exponential distribution of relative small-scale 
peculiar velocities (as seen empirically), the damping function is -D(y) ^ 
(1-1-2/^/2)"^/^, and a ~ 400 kms"^ is a reasonable estimate for the pairwise 
velocity dispersion of galaxies (e.g. Ballinger, Peacock k. Heavens 1996). 

In principle, this distortion should be a robust way to determine (or at 
least (3). In practice, the effect has not been easy to see with past datasets. 
This is mainly a question of depth: a large survey is needed in order to beat 
down the shot noise, but this tends to favour bright spectroscopic limits. 
This limits the result both because relatively few modes in the linear regime 
are sampled, and also because local survey volumes will tend to violate the 
small-angle approximation. Strauss & Willick (1995) and Hamilton (1997a) 
review the practical application of rcdshift-spacc distortions. In the next 
section, preliminary results are presented from the 2dF redshift survey, 
which shows the distortion effect clearly for the first time. 

5. The state of the art in LSS 
5.1. THE APM SURVEY 

In the past few years, much attention has been attracted by the estimate 
of the galaxy power spectrum from the APM survey (Baugh & Efstathiou 
1993, 1994; Maddox et al. 1996). The APM result was generated from a cat- 
alogue of ~ 10^ galaxies derived from UK Schmidt Telescope photographic 
plates scanned with the Cambridge Automatic Plate Measuring machine; 
because it is based on a deprojection of angular clustering, it is immune 
to the complicating effects of redshift-space distortions. The difficulty, of 
course, is in ensuring that any low-level systematics from e.g. spatial vari- 
ations in magnitude zero point are sufficiently well controlled that they do 
not mask the cosmological signal, which is of order w{0) < 0.01 at separa- 
tions of a few degrees. 

The best evidence that the APM survey has the desired uniformity is the 
scaling test, where the correlations in fainter magnitTidc slices are expected 
to move to smaller scales and be reduced in amplitude. If we increase the 
depth of the survey by some factor D, the new angular correlation function 
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will be 



w'{e) = ^wiDe). 



(45) 



The APM survey passes this test well; once the overall redshift distribution 
is known, it is possible to obtain the spatial power spectrum by inverting 
a convolution integral: 



(where zero spatial curvature is assumed). Here, (p{y) is the comoving den- 
sity at comoving distance y, normalized so that / y'^(/){y) dy = 1. 

This integral was inverted numerically by Baugh & Efstathiou (1993), 
and gives an impressively accurate determination of the power spectrum. 
The error estimates are derived empirically from the scatter between inde- 
pendent regions of the sky, and so should be realistic. If there are no unde- 
tected systematics, these error bars say that the power is very accurately 
determined. The APM result has been investigated in detail by a number 
of authors (e.g. Gaztanaga & Baugh 1998; Eisenstein & Zaldarriaga 1999) 
and found to be robust; this has significant implications if true. 

5.2. PAST REDSHIFT SURVEYS 

Because of the sheer number of galaxies, plus the large volume surveyed, 
the APM survey outperforms redshift surveys of the past, at least for the 
purpose of determining the power spectrum. The largest surveys of recent 
years (CfA: Huchra et al. 1990; LCRS: Shcctman et al. 1996; PSCz: Saun- 
ders et al. 1999) contain of order 10*^ galaxy redshifts, and their statistical 
errors are considerably larger than those of the APM. On the other hand, it 
is of great importance to compare the results of deprojection with clustering 
measured directly in 3D. 

This comparison was carried out by Peacock Sz Dodds (1994; PD94). 
The exercise is not straightforward, because the 3D results are affected 
by redshift-space distortions; also, different galaxy tracers can be biased to 
different extents. The approach taken was to use each dataset to reconstruct 
an estimate of the linear spectrum, allowing the relative bias factors to float 
in order to make these estimates agree as well as possible (figure 5). To 
within a scatter of perhaps a factor 1.5 in power, the results were consistent 
with a r ~ 0.25 CDM model. Even though the subsequent sections will 
discuss some possible disagreements with the CDM models at a higher level 
of precision, the general existence of CDM-like curvature in the spectrum 
is likely to be an important clue to the nature of the dark matter. 
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Figure 5. The PD94 compilation of power-spectrum measurements. The upper panel 
shows raw power measurements; the lower shows these data corrected for relative bias, 
nonlinear effects, and redshift-space effects. 



5.3. THE 2DF SURVEY 



The proper resolution of many of the observational questions regarding 
the large-scale distribution of galaxies requires new generations of redshift 
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Figure 6. A 4-degree thick slice of the Southern strip of the 2dF redshift survey. This 
restricted region alone contains 16,419 galaxies. 

survey that push beyond the N = 10^ barrier. Two groups are pursu- 
ing this goal. The Sloan survey (e.g. Margon 1999) is using a dedicated 
2.5-m telescope to measure redshifts for approximately 700,000 galaxies to 
r = 18.2 in the North Galactic Cap. The 2dF survey (e.g. Colless 1999) 
is using a fraction of the time on the 3.9-m Anglo-Australian Telescope 
plus Two-Degree Field spectrograph to measure 250,000 galaxies from the 
APM survey to Bj = 19.45 in the South Galactic Cap. At the time of 
writing, the Sloan spectroscopic survey has yet to commence. However, 
the 2dF project has measured 77,000 redshifts, and some preliminary clus- 
tering results are given below. For more details of the survey, particu- 
larly the team members whose hard work has made all this possible, see 
http : // www . mso . anu . edu . au/ 2dFGRS/ . 

One of the advantages of 2dF is that it is a fully sampled survey, so 
that the space density out to the depth imposed by the magnitude limit 
(median z = 0.12) is as high as nature allows: apart from a tail of low surface 
brightness galaxies (inevitably omitted from any spectroscopic survey) , the 
2dF measure all the galaxies that exist over a cosmologically representative 
volume. It is the first to achieve this goal. The fidelity of the resulting map 
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of the galaxy distribution can be seen in figure 6, which shows a smaU subset 
of the data: a shce of thickness 4 degrees, centred at decUnation —27°. 

An issue with using the 2dF data in their current form is that the sky 
has to be divided into circular 'tiles' each two degrees in diameter ('2dF' 
= 'two-degree field', within which the AAT is able to measure 400 spectra 
simultaneously; see http : //www . aao . gov . au/2df / for details of the instru- 
ment). The tiles arc positioned adaptively, so that larger overlaps occur in 
regions of high galaxy density. It this way, it is possible to place a fibre on 
> 95% of all galaxies. However, while the survey is in progress, there exist 
parts of the sky where the overlapping tiles have not yet been observed, 
and so the efi'ective sampling fraction is only ~ 50%. These eS'ects can be 
allowed for in two different ways. In clustering analyses, we compare the 
counts of pairs (or n-tuplets) of galaxies in the data to the corresponding 
counts involving an unclustered random catalogue. The effects of variable 
sampling can therefore be dealt with either by making the density of ran- 
dom points fluctuate according to the sampling, or by weighting observed 
galaxies by the reciprocal of the sampling factor for the zone in which they 
lie. The former approach is better from the point of view of shot noise, 
but the latter may be safer if there is any suspicion that the sampling fluc- 
tuations are correlated with real structure on the sky. In practice, both 
strategies give identical answers for the results below. 

At the two-point level, the most direct quantity to compute is the 
redshift-space correlation function. This is an anisotropic function of the 
orientation of a galaxy pair, owing to peculiar velocities. We therefore eval- 
uate ^ as a function of 2D separation in terms of coordinates both parallel 
and perpendicular to the line of sight. If the comoving radii of two galaxies 
are yi and ?/2 and their total separation is r, then we define coordinates 

7r=|yi-y2|; a = ^r^ --k^. (47) 

The correlation function measured in these coordinates is shown in figure 
7. In evaluating ^ (cr, tt) , the optimal radial weight discussed above has been 
applied, so that the noise at large r should be representative of true cosmic 
scatter. 

The correlation-function results display very clearly the two signatures 
of redshift-space distortions discussed above. The fingers of God from small- 
scale random velocities are very clear, as indeed has been the case from the 
first redshift surveys (e.g. Davis & Peebles 1983). However, this is arguably 
the first time that the large-scale flattening from coherent infall has been 
really obvious in the data. 

A good way to quantify the flattening is to analyze the clustering as a 
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cr/h ^Mpc 

Figure 7. The redshift-space correlation function from the 2dF data, ^ {a, tt) , with a bin 
size of 0.6 Mpc. a is the pair separation transverse to the line of sight; n is the radial 
separation. This plot clearly displays rcdshift distortions, with 'fingers of God' at small 
scales and the coherent Kaiser squashing at largo a. The distortions are quantified via 
the quadrupolc-to-monopolc ratio of ^ as a function of radius in the second panel. The 
contours are round at r = 7 Mpc, but flatten progressively thereafter. 



function of angle into Legendre polynomials: 

^e{r) = — — / ^{a = rsm9,Tr = rcos9) Pe{cos9) dcos9. (48) 
2 J-i 

The quadrupole-to-monopole ratio should be a clear indicator of coherent 
infall. In linear theory, it is given by 

1 + 2/3/3 + pys' ^^""^ 

where /(n) = (3 + n)/n (Hamilton 1992). On small and intermediate scales, 
the effective spectral index is negative, so the quadrupole-to-monopole ratio 
should be negative, as observed. 

However, it is clear that the results on the largest scales are still signif- 
icantly affected by finger-of-God smearing. The best way to interpret the 
observed effects is to calculate the same quantities for a model. To achieve 
this, we use the observed APM 3D power spectrum, plus the distortion 
model discussed above. This gives the plots shown in figure 8. The free 
parameter is /3, and this is set at a value of 0.5, approximately consistent 
with other arguments for a universe with Q = 0.3 and little large-scale bias 
(e.g. Peacock 1997). Although a quantitative comparison has not yet been 
carried out, it is clear that this plot closely resembles the observed data. 
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Figure 8. The redshift-space correlation function predicted from the real-space APM 
power spectrum, assuming the model of Ballinger, Peacock & Heavens (1996), with 
/? = 0.5. 



By the end of 2001, the size of the 2dF survey should have expanded 
by a factor 3. increasing the pair counts tenfold. It should then be possi- 
ble to trace the correlations well beyond the present limit, and follow the 
redshift-space distortion well into the linear regime. However, the biggest 
advantage of a survey of this size and uniformity is the ability to subdi- 
vide it. All analyses to date have lumped together very different kinds of 
galaxies, whereas we know from morphological segregation that different 
classes of galaxy have spatial distributions that differ from each other. The 
homogeneous 2dF data allow classification into different galaxy types (rep- 
resenting, physically, a sequence of star-formation rates), from the spectra 
alone (Folkes et al. 1999). It will be a critical test to see if the distortion sig- 
nature can be picked up in each type individually. Although the large-scale 
behaviour of each galaxy type will probably be quite similar, differences 
in the clustering properties will inevitably arise on smaller scales, giving 
important information about the sequence of galaxy formation. 

6. Small-scale clustering 
6.1. HISTORY 

One of the earliest models to be used to interpret the galaxy correlation 
function was to consider a density field composed of randomly-placed in- 
dependent clumps with some universal density profile (Neyman, Scott &; 
Shane 1953; Peebles 1974). Since the clumps are placed at random, the 
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only correlations arise from points in the same clump. The correlations are 
easily deduced by using statistical isotropy: calculate the excess number of 
pairs separated by a distance r in the z direction (chosen as some arbitrary 
polar axis in a spherically-symmetric clump). For power-law clumps, with 
p = nBr~^, truncated at r = i?, this model gives ^ oc r^~^^ in the limit 
r <^ R, provided 3/2 < e < 3. Values e > 3 are unphysical, and require a 
small-scale cutoff to the profile. There is no such objection to e < 3/2, and 
the expression for ^ tends to a constant for small r in this case (see Yano 
& Gouda 1999). 

A long-standing problem for this model is that the correlation function 
in this case is much flatter than is observed for galaxies: ^ oc r~^-^ is the 
canonical slope, requiring e = 2.4. The first reaction may be to say that 
the model is incredibly naive by comparison with our sophisticated present 
understanding of the nonlinear evolution of CDM density fields. However, 
as will be shown below, it may after all contain more than a grain of truth. 

6.2. THE CDM CLUSTERING PROBLEMS 

A number of authors have pointed out that the detailed spectral shape 
inferred from galaxy data appears to be inconsistent with that of nonlin- 
ear evolution from CDM initial conditions, (e.g. Efstathiou, Sutherland &; 
Maddox 1990; Klypin, Primack & Holtzman 1996; Peacock 1997). Perhaps 
the most detailed work was carried out by the VIRGO consortium, who 
carried out N = 256^ simulations of a number of CDM models (Jenkins 
et al. 1998). Their results are shown in figure 9, which gives the nonlinear 
power spectrum at various times (cluster normalization is chosen for z = 0) 
and contrasts this with the APM data. The lower small panels arc the 
scale-dependent bias that would required if the model did in fact describe 
the real universe, defined as 



In all cases, the required bias is non-monotonic; it rises at A; > 5 h^^ Mpc, 
but also displays a bump around k ~ 0.1 Mpc. If real, this feature seems 
impossible to understand as a genuine feature of the mass power spectrum; 
certainly, it is not at a scale where the effects of even a large baryon fraction 
would be expected to act (Eisenstein et al. 1998; Meiksin, White & Peacock 




(50) 



1999). 
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7. Bias 

The conclusions from the above discussion are either that the physics of 
dark matter and structure formation are more complex than in CDM mod- 
els, or that the relation between galaxies and the overall matter distribution 
is sufficiently complicated that the effective bias is not a simple slowly- 
varying monotonic function of position. 
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7.1. SIMPLE BIAS MODELS 

The simplest assumption is that all the complicated physical effects leading 
to galaxy formation depend in a causal (but nonlinear) way on the local 
mass density, so that we write 



Coles (1993) showed that, under rather general assumptions, this equation 
would lead to an effective bias that was a monotonic function of scale. This 
issue was investigated in some detail by Mann, Peacock k, Heavens (1998), 
who verified Coles' conclusion in practice for simple few-parameter forms 
for /, and found in all cases that the effective bias varied rather weakly with 
scale. The APM results thus are either inconsistent with a CDM universe, 
or require non-local bias. 

A puzzle with regard to this conclusion is provided by the work of Jing, 
Mo k. Borner (1998). They evaluated the projected real-space correlations 
for the LCRS survey (see figure 10). This statistic also fails to match the 
prediction of CDM models, but this can be amended by introducing a sim- 
ple antibias scheme, in which galaxy formation is suppressed in the most 
massive haloes. This scheme should in practice be very similar to the Mann, 
Peacock & Heavens recipe of a simple weighting of particles as a function 
of the local density; indeed, the main effect is a change of amplitude, rather 
than shape of the correlations. The puzzle is this: if the APM power spec- 
trum is used to predict the projected correlation function, the result agrees 
almost exactly with the LCRS. Either projected correlations are a rather 
insensitive statistic, or perhaps the Baugh & Efstathiou deconvolution pro- 
cedure used to get P{k) has exaggerated the significance of features in 
the spectrum. The LCRS results are one reason for treating the apparent 
conflict between APM and CDM with caution. 

7.2. HALO CORRELATIONS 

In reality, bias is unlikely to be completely causal, and this has led some 
workers to explore stochastic bias models, in which 



where e is a random field that is uncorrelated with the mass density (Pen 
1998; Dekel & Lahav 1999). Although truly stochastic effects are possible 
in galaxy formation, a relation of the above form is expected when the 
galaxy and mass densities are filtered on some scale (as they always are, 
in practice) . Just averaging a galaxy density that is a nonlinear function of 
the mass will lead to some scatter when comparing with the averaged mass 



Plight = /(P mass / • 



(51) 



Plight = /(Pmass) + e, 



(52) 
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Figure 10. The projected correlation function from the LCRS fails to match CDM 
models when comparison is made to just the mass distribution. However, the agreement 
is excellent when allowance is made for a small degree of scale-dependent antibias; galaxy 
formation is suppressed in the most massive haloes (Jing, Mo & Borner 1998). 

field; a scatter will also arise when the relation between mass and light is 
non-local, however, and this may be the dominant effect. 

The simplest and most important example of non-locality in the galaxy- 
formation process is to recognize that galaxies will generally form where 
there are galaxy-scale haloes of dark matter. In the past, it was generally 
believed that dissipative processes were critically involved in galaxy for- 
mation, since pure collisionless evolution would lead to the destruction of 
galaxy-scale haloes when they are absorbed into the creation of a larger- 
scale nonlinear system such as a group or cluster. However, it turns out 
that this overmerging problem was only an artefact of inadequate resolu- 
tion. When a simulation is carried out with ~ 10^ particles in a rich cluster, 
the cores of galaxy-scale haloes can still be identified after many crossing 
times (Ghigna et al. 1997). Furthermore, if catalogues of these 'sub-haloes' 
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are created within a cosmological-sized simulation, their correlation func- 
tion is quite different from that of the mass, resembling the single power 
law seen in galaxies (e.g. Klypin et al. 1999; Ma 1999). 

These are very important results, and they hold out the hope that many 
of the issues concerning where galaxies form in the cosmic density field can 
be settled within the domain of collisionless simulations. Dissipative physics 
will still be needed to understand in detail the star-formation history within 
a galaxy-scale halo. Nevertheless, the idea that there may be a one-to-one 
correspondence between galaxies and galaxy-scale dark-matter haloes is 
clearly an enormous simplification - and one that increases the chance of 
making robust predictions of the statistical properties of the galaxy popu- 
lation. 

7.3. NUMERICAL GALAXY FORMATION 

The formation of galaxies must be a non-local process to some extent. The 
modern paradigm was introduced by White & Rees (1978): galaxies form 
through the cooling of baryonic material in virialized haloes of dark matter. 
The virial radii of these systems are in excess of 0.1 Mpc, so there is the 
potential for large differences in the correlation properties of galaxies and 
dark matter on these scales. 

A number of studies have indicated that the observed galaxy correlations 
may indeed be reproduced by CDM models. The most direct approach is a 
numerical simulation that includes gas, and relevant dissipative processes. 
This is challenging, but just starting to be feasible with current computing 
power (Pearce et al. 1999). The alternative is 'semianalytic' modelling, in 
which the merging history of dark-matter haloes is treated via the extended 
Press-Schechter theory (Bond et al. 1991), and the location of galaxies 
within haloes is estimated using dynamical-friction arguments (e.g. Cole et 
al. 1996; Kauffmann et al. 1996; Somerville & Primack 1997). Both these 
approaches have yielded similar conclusions, and shown how CDM models 
can match the galaxy data: specifically, the low-density flat ACDM model 
that is favoured on other grounds can yield a correlation function that is 
close to a single power law over 1000 > ^ > 1, even though the mass 
correlations show a marked curvature over this range (Pearce et al. 1999; 
Benson et al. 1999; see figure 11). These results are impressive, yet it is 
frustrating to have a result of such fundamental importance emerge from 
a complicated calculational apparatus. There is thus some motivation for 
constructing a simpler heuristic model that captures the main processes at 
work in the full semianalytic models. The following section describes an 
approach of this sort (Peacock &; Smith, in preparation). 
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Figure 11. The correlation function of galaxies in the semianalytical simulation of an 
LCDM universe by Benson et al. (1999). 



7.4. HALO-OLOGY AND BIAS 



We mentioned above the early model of Neyman, Scott & Shane (1953), 
in which the nonlinear density field was taken to be a superposition of 
randomly-placed clumps. With our present knowledge about the evolution 
of CDM universes, we can make this idealised model considerably more re- 
alistic: hierarchical models are expected to contain a distribution of masses 
of clumps, which have density profiles that are more complicated than 
isothermal spheres. These issues are well studied in A^-body simulations, 
and highly accurate fitting formulae exist, both for the mass function and 
for the density profiles. Briefiy, we use the mass function of Sheth k. Tormen 
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(1999; ST) and the halo profiles of Moore et al. (1999; M99). 

f{u) = 0.21617[1 + (\/2/jy2)0-3] exp[-i.V(2V2)] 
^ F{> v) = 0.32218[1 - evi{u/2^/^)\ 
+ 0.14765r[0.2,zyV(2V2)], 



(53) 



where V is the incomplete gamma function. 

Recently, it has been claimed by Moore et al. (1999; M99) that the 
commonly-adopted density profile of Navarro, Frenk &; White (1996; NFW) 
is in error at small r. M99 proposed the alternative form 



Using this model, it is then possible to calculate the correlations of the 
nonlinear density field, neglecting only the large-scale correlations in halo 
positions. The power spectrum determined in this way is shown in figure 
12, and turns out to agree very well with the exact nonlinear result on small 
and intermediate scales. The lesson here is that a good deal of the nonlinear 
correlations of the dark matter field can be understood as a distribution of 
random clumps, provided these are given the correct distribution of masses 
and mass-dependent density profiles. 

How can we extend this model to understand how the clustering of 
galaxies can differ from that of the mass? There are two distinct ways in 
which a degree of bias is inevitable: 

(1) Halo occupation numbers. For low-mass haloes, the probability of 
obtaining an L* galaxy must fall to zero. For haloes with mass above 
this lower limit, the number of galaxies will in general not scale with 
halo mass. 

(2) Nonlocality. Galaxies can orbit within their host haloes, so the prob- 
ability of forming a galaxy depends on the overall halo properties, 
not just the density at a point. Also, the galaxies will end up at spe- 
cial places within the haloes: for a halo containing only one galaxy, 
the galaxy will clearly mark the halo centre. In general, we expect 
one central galaxy and a number of satellites. 

The numbers of galaxies that form in a halo of a given mass is the 
prime quantity that numerical models of galaxy formation aim to calcu- 
late. However, for a given assumed background cosmology, the answer may 
be determined empirically. Galaxy redshift surveys have been analyzed via 
grouping algorithms similar to the 'friends-of-friends' method widely em- 
ployed to find virialized clumps in N-hody simulations. With an appropri- 
ate correction for the survey limiting magnitude, the observed number of 
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Figure 12. The power spectrum for the ACDM model. The sohd hncs contrast the 
hnear spectrum with the nonhnear spectrum, calculated according to the approximation 
of PD96. The spectrum according to randomly-placed haloes is denoted by open circles; 
if the linear power spectrum is added, the main features of the nonlinear spectrum are 
well reproduced. 



galaxies in a group can be converted to an estimate of the total stellar lu- 
minosity in a group. This allows a determination of the All Galaxy System 
(AGS) luminosity function: the distribution of virialized clumps of galaxies 
as a function of their total luminosity, from small systems like the Local 
Group to rich Abell clusters. 

The AGS function for the CfA survey was investigated by Moore, Prenk 
& White (1993), who found that the result in blue light was well described 

by 

# = ^* [{L/L*f + {L/Ly] dL/L\ (55) 

where ^* = 0.00126/i3Mpc-3, /3 = 1.34, 7 = 2.89; the characteristic lumi- 
nosity is M* = —21.42 -I- 51ogiQ h in Zwicky magnitudes, corresponding to 
= -21.71 + 51ogio h, or L* = 7.6 x W^H-'^Lq, assuming M® = 5.48. 
One notable feature of this function is that it is rather flat at low lumi- 
nosities, in contrast to the mass function of dark-matter haloes (see Sheth 
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Figure 13. The empirical luminosity-mass relation required to reconcile the observed 
AGS luminosity function with two variants of CDM. L* is the characteristic luminosity in 
the AGS luminosity function (L* = 7.6 x lO^^/i"^!/©). Note the rather fiat slope around 
M = 10" to W^^h'^MQ, especially for ACDM. 

&; Tormen 1999). It is therefore clear that any fictitious galaxy catalogue 
generated by randomly sampling the mass is unlikely to be a good match 
to observation. The simplest cure for this deficiency is to assume that the 
stellar luminosity per virialized halo is a monotonic, but nonlinear, function 
of halo mass. The required luminosity-mass relation is then easily deduced 
by finding the luminosity at which the integrated AGS density $(> L) 
matches the integrated number density of haloes with mass > M. The 
result is shown in figure 13. 

We can now return to the halo-based galaxy power spectrum and use 
the correct occupation number, N, as a, function of mass. This is needs 
a little care at small numbers, however, since the number of haloes with 
occupation number unity affects the correlation properties strongly. These 
haloes contribute no correlated pairs, so they simply dilute the signal from 
the haloes with N >2. The existence of antibias on intermediate scales can 
probably be traced to the fact that a large fraction of galaxy groups contain 
only one > galaxy. Finally, we need to put the galaxies in the correct 




Figure 14- The power spectrum for a galaxy catalogue constructed from the ACDM 
model. A reasonable agreement with the APM data (solid line) is achieved by simple 
empirical adjustment of the occupation number of galaxies as a function of halo mass, 
plus a scheme for placing the haloes non-randomly within the haloes. 



location, as discussed above. If one galaxy always occupies the halo centre, 
with others acting as satellites, the small-scale correlations automatically 
follow the slope of the halo density profile, which keeps them steep. The 
results of this exercise arc shown in figure 14. 

Although it is encouraging that it is possible to find simple models in 
which it is possible to understand the observed correlation properties of 
galaxies, there arc other longstanding puzzles concerning the galaxy dis- 
tribution. Arguably the chief of these concerns the dynamical properties 
of galaxies, in particular the pairwise peculiar velocity dispersion. This 
statistic has been the subject of debate, and preferred values have crept 
up in recent years, to perhaps 450 or 500kms~^ at projected separations 
around 1 Mpc (e.g. Jing, Mo & Borner 1998), most simple models predict 
a higher figure. Clearly, the amplitude of peculiar velocities depends on the 
normalization of the fluctuation spectrum; however, if this is set from the 
abundance of rich clusters, then Jenkins et al. (1998) found that reasonable 
values were predicted for large-scale streaming velocities, independent of Q. 
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However, Jenkins et aL also found a robust prediction for the pairwise pe- 
culiar velocity dispersion around 1 Mpc of about 800kms~^. The observed 
galaxy velocity field appears to have a higher 'cosmic Mach number' than 
the predicted dark-matter distribution. 

This difficulty is also solved by the simple bias model discussed here. 
Two factors contribute: the variation of occupation number with mass 
downweights the contribution of more massive groups, with larger velocity 
dispersions. Also, where one galaxy is centred on a halo, it gains a peculiar 
velocity which is that of the centre of mass of the halo, but does not reflect 
the internal velocity dispersion of the halo. Given a full A?^-body simula- 
tion, it is easy enough to predict what would be expected for a realistic 
bias model: one needs to construct a halo catalogue, calculating the pecu- 
liar velocities and internal velocity dispersions of each halo. Knowing the 
occupation number as a function of mass, a montecarlo catalogue of 'galax- 
ies' complete with peculiar velocities can be generated. As shown in figure 
15, the effect of the empirical bias recipe advocated here is sufficient to re- 
duce the predicted dispersion into agreement with observation. The simple 
model outlined here thus gives a consistent picture, and it is tempting to 
believe that it may capture some of the main features of realistic models 
for galaxy bias. 

8. Conclusions 

It should be clear from these lectures that large-scale structure has ad- 
vanced enormously as a field in the past two decades. Many of our long- 
standing ambitions have been realised; in some cases, much faster than we 
might have expected. Of course, solutions for old problems generate new 
difficulties. We now have good measurements of the clustering spectrum 
and its evolution, and it is arguable that the discussion of section 7.4 cap- 
tures the main features of the placement of galaxies with respect to the 
mass. However, a fairly safe bet is that one of the major results from new 
large surveys such as 2dF and Sloan will be a heightened appreciation of 
the subtleties of this problem. 

Nevertheless, we should not be depressed if problems remain. Observa- 
tionally, we are moving from an era of 20% - 50% accuracy in measures of 
large-scale structure to a future of pinpoint precision. This maturing of the 
subject will demand more careful analysis and rejection of some of our ex- 
isting tools and habits of working. The prize for rising to this challenge will 
be the ability to claim a real understanding of the development of structure 
in the universe. We are not there yet, but there is a real prospect that the 
next 5-10 years may see this remarkable goal achieved. 
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Figure 15. The linc-of-sight pairwisc velocity dispersion for the ACDM rnodeL The top 
curve shows the resuhs for all the mass; the lower pair of curves shows the predicted 
galaxy results, with and without assuming that one galaxy occupies the halo centre (the 
former case gives the lowest curve). 
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